Support for proofs of null/absence. Dried up prove/verify. #82

zmitton · 2019-02-14T22:54:59Z

The best data structure for a proof, is actually a merkle-patricia-tree itself. This tree can be built by batching all the node values of the proof into the underlying db at their keccak as key. This optimizes multiple proofs by not having a need for duplicates. The verification takes place by simply using this tree that you built, as if it were the real merkle-patricia-tree. You can do any operations on it that you would normally do to the main one. The only subtlety, is that if at any point during traversal of said tree, it tries to “step in” to a hash value that it cant find in the underlying db, this means the proof is missing pieces and is invalid.

This will correctly handle null leaves. When performing get() on a value that is null, it will find its target node index (of the 17) that contains an empty byte array. This means the key corresponded to null. You can still even do put, because you will again arrive at an empty byte array and knowing that anything the rest of the way down said path was not initialized, can put the extension node to the new value and then hash each node back to the root as usual.

The current serialization format will work fine, but now can include multiple proofs with nodes in any ordering, except that the root node should always be at index[0].

zmitton · 2019-02-15T00:20:26Z

fixes #64

coveralls · 2019-02-15T00:23:58Z

Coverage increased (+0.7%) to 93.761% when pulling 2f80f33 on zmitton:null-proof into 88f729d on ethereumjs:master.

zmitton · 2019-02-15T00:24:38Z

fixes #47

zmitton · 2019-02-15T00:49:50Z

fixes #19

zmitton · 2019-02-15T01:29:40Z

Fixes #17

holgerd77 · 2019-02-15T09:51:27Z

Hi @s1na, can you take a look at this, think you are actually deeper into the library than me? This will probably fail due to the Tape issue after merge on another CI run on master. Think it should nevertheless be ok to merge (if you are ok with it) and then directly merge-in #81 to have this fixed.

holgerd77 · 2019-02-15T09:54:35Z

@zmitton 4 fixes by one PR, that sounds at least extremely tempting! 😄 Thanks for the PR!

zmitton · 2019-02-15T19:38:48Z

Looks like it 👍. That last one was completely accidental

zmitton · 2019-02-16T21:06:07Z

src/baseTrie.js

@@ -49,6 +49,38 @@ module.exports = class Trie {
    this.root = root
  }

+  static fromProof(proofNodes, cb, proofTrie){


what you are making here could be called a sparse trie . optional proofTrie argument so you can build onto an existing sparse trie is you want.

zmitton · 2019-02-16T21:08:47Z

src/baseTrie.js

@@ -49,6 +49,38 @@ module.exports = class Trie {
    this.root = root
  }

+  static fromProof(proofNodes, cb, proofTrie){
+    let opStack = proofNodes.map((nodeValue) => {
+      return {type: 'put', key: ethUtil.keccak(nodeValue), value: ethUtil.toBuffer(nodeValue)}


its kindof like the verification is done in this step because when you traverse the tree you already know each key is the hash of the node it's storing

zmitton · 2019-02-16T21:18:58Z

src/baseTrie.js

@@ -434,7 +467,8 @@ module.exports = class Trie {
          childKey.push(childIndex)
          const priority = childKey.length
          taskExecutor.execute(priority, taskCallback => {
-            self._lookupNode(childRef, childNode => {
+            self._lookupNode(childRef, (e, childNode) => {
+              if(e){ return cb(e, null)}


This is the line that makes my tests pass, but I added the line to anywhere _lookupNode is being called (except in checkRoot which I believe is expected to just return false in that case).

So probably it could use some more tests that make sure it's properly propagating the other errors (but current tests continue to pass).

zmitton · 2019-02-16T21:21:39Z

test/index.js

  it('should not crash if given a non-existant root', function (t) {
    var root = new Buffer('3f4399b08efe68945c1cf90ffe85bbe3ce978959da753f9e649f034015b8817d', 'hex')
    var trie = new Trie(null, root)

    trie.get('test', function (err, value) {
      t.equal(value, null)
-      t.end(err)
+      t.notEqual(err, null)


note The behavior here has changed slightly, but reading the objective of the test I think it's ok

s1na

This is cool! I like the general approach. I suggest we break this into two PRs, one for modifying the _lookupNode behaviour (to make sure it has no unintended consequences), and one for the proofs.

zmitton · 2019-02-18T16:34:18Z

ok yeah, I can do that. And might add some more tests before I re-submit the later

holgerd77 · 2019-02-26T10:20:33Z

Any update on this?

zmitton · 2019-02-26T16:34:53Z

As requested by @s1na I've put the fix to 17 in its own PR. It might be another week or so before I submit the other half separately because I would like to test a few more angles from my own repo eth-proof . This is not so easy because I want to build specific configurations of trees to get the edge cases

holgerd77 · 2019-03-01T14:41:07Z

Just for mental preparation 😄: this will need some rebase and squashing of commits once ready. Is this necessary that you do this on top of the other code changes (haven't looked deeper into the code)? This will - probably - a bit difficult to get this out again once the other PR is merged?

zmitton · 2019-03-18T01:49:17Z

ok, I have squashed all the commits into 1 per pull-request. Both should be ready to go

holgerd77 · 2019-03-19T06:20:46Z

Ah, this would need to have the commit from the merged PR removed. It also needs a rebase.

holgerd77 · 2019-03-19T06:21:22Z

(also: linting is currently failing, run npm run lint before pushing)

zmitton · 2019-03-19T20:08:09Z

I'm not sure what you mean. This PR wont work without 0cd83dc underneith it.

can you describe the exact steps you would like done or maybe just cherry pick them in or something?

holgerd77 · 2019-03-19T20:56:34Z

The _lookupNode callback PR #83 has now been merged, therefore the respective commit should be removed from this PR, otherwise it is applied twice (with git rebase -i master from your branch, and then using drop on the commit).

Then git rebase master has to be applied to get the branch on top of latest master state and finally you should run npm run lint, fix the linting errors and add the fixes (eventually also with git rebase -i master and then squash a linting correcting commit to the main work commit).

Does this help? Did I overlook something?

holgerd77 · 2019-04-10T09:06:49Z

Hi @zmitton do you find some time to update this here? I would love to merge and do a release! 😀

holgerd77 · 2019-04-29T13:18:50Z

@zmitton This is not to get on your nerves 😄, just a gentle reminder in case this gets forgotten: if you find some time, an update would be appreciated, especially since we are really close to the finish line. 🎯

zmitton · 2019-05-03T05:01:34Z

@holgerd77 luckily i didn't have to do any of the fancy stuff. Just a normal rebase from master. I believe it achieves the desired result. Let me know :)

lgtm-com · 2019-05-03T05:03:53Z

This pull request fixes 1 alert when merging 73e7b6d into 88f729d - view on LGTM.com

fixed alerts:

1 for Useless assignment to local variable

Comment posted by LGTM.com

s1na

Overall I think it's a really interesting idea, and if it actually works I'm sure it'll have its
use-cases.

I'd feel more confident if there was any other client which also had proof of absence, as I'm not sure if there are no edge cases.

In addition to the comments below, I tried to integrate this branch into ethereumjs-vm which failed (for reasons not related to this PR).

s1na · 2019-05-03T09:47:39Z

src/baseTrie.js

+  static prove (trie, key, cb) {
+    trie.findPath(key, function (err, node, remaining, stack) {
+      if (err) return cb(err)
+      let p = stack.map((stackElem) => { return stackElem.serialize() })


You're including embedded nodes (specifically those with length < 32) in the proof in comparison with the previous prove method. Is that necessary for the non-existence proofs?

s1na · 2019-05-03T09:48:43Z

src/baseTrie.js

+    })
+  }
+
+  static verifyProof (rootHash, key, proofNodes, cb) {


Shouldn't root of proofTrie be checked against rootHash here?

Actually yeah that's true. It's hard to say who's job this is. But if it's going to take the argument rootHash it should verify it. somehting like this will do it

static verifyProof (rootHash, key, proofNodes, cb) { let proofTrie = new Trie(null, rootHash) Trie.fromProof(proofNodes, (error, proofTrie) => { if (error) cb(new Error('Invalid proof nodes given'), null) proofTrie.get(key, (e, r) => { return cb(e, r) }) }, proofTrie)

s1na · 2019-05-03T09:50:50Z

src/baseTrie.js

@@ -135,16 +167,11 @@ module.exports = class Trie {
      cb(null, new TrieNode(node))
    } else {
      this.db.get(node, (err, value) => {
-        if (err) {
-          throw err


db.get doesn't return an error if value is not found (just null for value). I wouldn't remove this if there's no special reason for this, as I can imagine in future db.get could potentially return errors for other reasons (e.g. validation of input).

db.get doesn't return an error if value is not found
There is a special reason for it. The way in which this tree is traversed and the property that is is a merkle tree, the child node must always exist in a valid tree.

I can imagine in future db.get could potentially return errors for other reasons
Im still returning that error below. Im just not blowing up as before (throwing an error during async function was just unhelpful blow up)

s1na · 2019-05-03T10:08:07Z

test/proof.js

@@ -37,30 +37,53 @@ tape('simple merkle proofs generation and verification', function (tester) {
          })
        })
      },
-      function (cb) {
+      function (cb) { // should create a valid proof of null


Hm, I wish you had simply added new test cases instead of modifying existing ones. Are these 3 cases not valid anymore? Going through them I have a hard time distinguishing between different result of verifyProof:

If proof is for a key that doesn't exist and verifyProof checks the same key, val and err are null

If proof is for a key and verifyProof checks a totally random key (that wasn't in proof), there'll be error

In the first test case, proof is for key2bb, then it tries to verifyProof for key2, and it again returns null for val and err. I.e. the same proof is also a valid proof of absence for key2, which is certainly interesting, although maybe unexpected?

I'm not sure if users would need to distinguish between all the various states? One benefit of the previous approach in my eyes is that only the value for which the proof was generated would be verified.

If proof is for a key that doesn't exist and verifyProof checks the same key, val and err are null

the proof should prove the key is associated with null. It returns the correct value -> null, and it does not raise an error, because the verify function, having been called on that key, returned the sufficient nodes to prove this.

If proof is for a key and verifyProof checks a totally random key (that wasn't in proof), there'll be error

since this proof does not contain sufficient nodes to prove the random key, verifyProof(randomKey) will return an error. It does not matter if randomKey exists or not. It matter that the proof contains the correct nodes to verify such.

In the first test case, proof is for key2bb, then it tries to verifyProof for key2, and it again returns null for val and err. I.e. the same proof is also a valid proof of absence for key2, which is certainly interesting, although maybe unexpected?

Yes, this was an interesting edge-case that I wanted to test. I did not properly document it. In this case, the proof happens to contain enough nodes to prove the value at key2 because traversing into key22 would touch all the same nodes as traversing into key2 (and an extra). So the proof from key22 will be valid to prove the value of any shorter key.

One benefit of the previous approach in my eyes is that only the value for which the proof was generated would be verified

I dont think thats a benefit except maybe for testing. It also has a major drawback in that it does not allow proofs to be efficiently combined. This approach allows the "proofTree" to be drop-in replacement for a "regular tree" (it can even do put). A light client could in the future process state transitions using existing EVM software.

partial/sparse tree support for adding to existing sparse tree lint fixup verify root as well

lgtm-com · 2019-05-03T21:11:31Z

This pull request fixes 1 alert when merging 2f80f33 into 88f729d - view on LGTM.com

fixed alerts:

1 for Useless assignment to local variable

Comment posted by LGTM.com

zmitton · 2019-05-06T18:13:04Z

I'd feel more confident if there was any other client which also had proof of absence, as I'm not sure if there are no edge cases.

I made the npm package that does use all this stuff by the way

s1na

Just checked, Geth has a similar behaviour for null keys, from their documentation:

If the trie does not contain a value for key, the returned proof contains all
nodes of the longest existing prefix of the key (at least the root node), ending
with the node that proves the absence of the key.

I'm going to approve the changes. I'd appreciate it however if someone else would also review.

holgerd77 · 2019-06-22T07:06:37Z

Ok, will merge here now. I would suggest we release this as v3.1.0?

s1na · 2019-06-24T09:50:27Z

I'll have to look deeper, but just having a quick look at the previous merged PRs I think we might have to do a v4.0.0 release, as there were some changes to the API.

A tiny change (this doesn't warrant a major release though) is #83. But the major change is #74. It drops the getRaw and other ..Raw methods, and requires passing a DB instance to the constructor.

One option (that I tend to favor) is to add a simple wrapper to baseTrie for now which calls db.get, etc. (for backwards-compatibility), and don't change how the constructor accepts the db. Then we can do a v3.1.0 release.

Something else I suggest we change is the ES6 export syntax introduced in #71 in the util files (e.g. https://github.com/ethereumjs/merkle-patricia-tree/pull/71/files#diff-6ab8792413c5dfbbbfa72dfcdedbb5d1). I had some problems with it in the browser.

If you agree with the above suggestions I can go ahead and prepare a PR.

holgerd77 · 2019-06-24T10:03:28Z

Sounds good, would be great if you prepare a PR.

zmitton force-pushed the null-proof branch 2 times, most recently from fe5687a to fe225f9 Compare February 14, 2019 23:20

zmitton force-pushed the null-proof branch from 4a87b46 to 648372f Compare February 15, 2019 01:15

holgerd77 requested a review from s1na February 15, 2019 09:49

zmitton commented Feb 16, 2019

View reviewed changes

s1na suggested changes Feb 18, 2019

View reviewed changes

zmitton mentioned this pull request Feb 26, 2019

_lookupNode callback to use standard error, response pattern #83

Merged

zmitton force-pushed the null-proof branch from 3c0cc26 to d6671da Compare March 18, 2019 01:45

zmitton force-pushed the null-proof branch from d6671da to 73e7b6d Compare May 3, 2019 04:58

s1na reviewed May 3, 2019

View reviewed changes

Support for proofs of null/absence. Dried up prove/verify.

2f80f33

partial/sparse tree support for adding to existing sparse tree lint fixup verify root as well

zmitton force-pushed the null-proof branch from 73e7b6d to 2f80f33 Compare May 3, 2019 21:05

s1na approved these changes May 7, 2019

View reviewed changes

holgerd77 merged commit b8f612b into ethereumjs:master Jun 22, 2019

s1na mentioned this pull request Jul 23, 2019

Stateless execution prototype ethereumjs/ethereumjs-monorepo#556

Closed

holgerd77 mentioned this pull request Jul 25, 2019

Add basic multiproof generation and verification #94

Open

ryanio mentioned this pull request Apr 17, 2020

Release - v4.0.0 #111

Merged

ryanio mentioned this pull request May 7, 2020

_lookupNode is not err-first callback style #17

Closed

This was referenced Jan 19, 2021

Proof for non-existence values (0x0) #47

Closed

better proof/verification support #64

Closed

seunlanlege mentioned this pull request Dec 27, 2021

Support for proofs of abscence? paritytech/trie#147

Closed

Support for proofs of null/absence. Dried up prove/verify. #82

Support for proofs of null/absence. Dried up prove/verify. #82

Conversation

zmitton commented Feb 14, 2019

zmitton commented Feb 15, 2019

coveralls commented Feb 15, 2019 • edited Loading

zmitton commented Feb 15, 2019

zmitton commented Feb 15, 2019

zmitton commented Feb 15, 2019

holgerd77 commented Feb 15, 2019

holgerd77 commented Feb 15, 2019

zmitton commented Feb 15, 2019

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

s1na left a comment

Choose a reason for hiding this comment

zmitton commented Feb 18, 2019

holgerd77 commented Feb 26, 2019

zmitton commented Feb 26, 2019 • edited Loading

holgerd77 commented Mar 1, 2019

zmitton commented Mar 18, 2019

holgerd77 commented Mar 19, 2019

holgerd77 commented Mar 19, 2019

zmitton commented Mar 19, 2019

holgerd77 commented Mar 19, 2019

holgerd77 commented Apr 10, 2019

holgerd77 commented Apr 29, 2019

zmitton commented May 3, 2019

lgtm-com bot commented May 3, 2019

s1na left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

zmitton May 3, 2019 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

lgtm-com bot commented May 3, 2019

zmitton commented May 6, 2019 • edited Loading

s1na left a comment

Choose a reason for hiding this comment

holgerd77 commented Jun 22, 2019

s1na commented Jun 24, 2019

holgerd77 commented Jun 24, 2019

coveralls commented Feb 15, 2019 •

edited

Loading

zmitton commented Feb 26, 2019 •

edited

Loading

zmitton May 3, 2019 •

edited

Loading

zmitton commented May 6, 2019 •

edited

Loading